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1. Introduction 

Local dependence, when the variables only depend on those others which are in their neigh- 
borhood, has been one of the first examples of Stein's method, see (Chen and Shao, 2004) 
and the references therein. 

The usual form of local dependence is the following (based on (Chen, Goldstein and Shao, 
2011), Chapter 4.7.): 

Definition 1. A group of random variables {Xi,i G A} satisfies {LD1) if for each i G A 
there exists Ai G [n] such that and X^ c are independent. 

An undirected graph Q = ([n], £ ) is called a dependency graph of{Xi, i G ^4} if each Xi can 
only depend on its neighbors in Q (i.e. it is independent of the complement of its neighbors). 
An example for such a graph Q is a graph with edge between i and j if i G Aj or j G Ai (i.e. 
one of them is in the neighborhood of the other). 

We say that {X i: i G [n]} satisfies (LD1, m) if there is a Q dependency graph that has 
maximum degree at most m — 1 . 

Let Q = (V, E) be an undirected graph. The chromatic number of Q, x{G) is the smallest 
positive integer k such that the vertices of Q can be colored with k colors with no edge 
between vertices of the same color. 

(Janson, 2004) shows concentration of sums under (LD1), and shows that Chernoff- 
Hoeffding and Bernstein inequalities also hold for sums of (LD1) dependent variables, with 
constants less than x{S) times weaker than in the independent case. 

The objective of this paper is to investigate whether this result holds for more general 
functions of (LD1) dependent variables. We could extend (Janson, 2004) to subadditive 
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functions. On the other hand, as the following counterexample shows, concentration does 
not holds for arbitrary Hamming-Lipschitz functions: 

Example 1.1. For n even, let Xx, . . . ,X n / 2 ,X[, ...,X' n , 2 be random variables taking values 
1 and -1. Let X!,...,X re/2 be i.i.d. with P(Xi = 1) = P(X { = -1) = 1/2. Let Q be an 
independent random variable with P(Q = 1) = P{Q = —1) = 1/2. Define X- = Q • Xi, 
1 < % < n/2. 

For {Xi, . . . ,X n /2,X[, ...,X' n / 2 } as defined such satisfy (LD1,2). Define the function g : 
M 2 ^Ras g(a,b) = a-b/2, then #(1,1) = #(-1,-1) = 1/2 andg(l,-l) = g(-l, 1) = -1/2. 
Define f(Xi,...,X n / 2 ,X[ 1 ...,X' n ^ 2 ) = Y^i=i9(Xi,X' i ) ) then this f is 1-Hamming Lipschitz 
(depending on n variables). On the other hand, for the distribution we gave to Xi and X[, 
we have g(Xi, X[) = Q, so f{X\, X n / 2 , X[, X', 2 ) = nQ/2, taking values n/2 and —n/2 
with probability 1/2. 

We have looked for examples in the literature about (LD1) dependent random variables, 
and most of them were defined as functions of independent random variables. For such cases, 
as we will show, concentration inequalities hold for general functions. 

1.1. Main definitions 

We will use the fractional chromatic number: 

Definition. Let Q = (V,E) be an undirected graph. The fractional chromatic number of Q , 
X*(G) is the smallest positive real k for which there exists a probability distribution over the 
independent sets of Q such that for each vertex v, given an independent set S drawn from 
the distribution, 

Pr(v eS)>-. 

k 

The independent sets of Q here mean all the subsets of the vertices of Q that contain no 
edges between them. 

(Janson, 2004) introduces these: 

Definition. Given A and {X a }, a G A, we make the following definitions: 

• A subset A' of A is independent if the corresponding random variables {X a } a£ A> (ire 
independent. 

• A family {Aj}j of subsets of A is a cover of A ifUjAj = A. 

• A family {(Aj,Wj)}j of pairs (Aj,Wj) where Aj C A and Wj G [0,1] is a fractional 

cover of A if J2j- a eA w i — ^ f or eac ^ a e 

• A (fractional) cover is proper if each set Aj is independent. 

• x(A) is the size of the smallest proper cover of A, i.e. the smallest m such that A is 
the union of m independent subsets 

• x*(A) is the minimum ofJ2j w j over all proper fractional covers {(Aj,Wj)}j. 

• We say that a fractional cover {(Aj,Wj)}j is exact if ^ . Wjt^ = 1a- 

It is shown in (Janson, 2004) that for (LDl,m) with dependency graph Q, 



X*(A)< X *(G)<x(G)<m. 
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Lemma 3.2 of (Janson, 2004) shows that we can make an exact fractional cover from 
any fractional cover without changing F.^j, thus we can restrict our attention to exact 
fractional covers. 

The first result for sums is the following: 

Theorem 1.1 (Theorem 2.1 of (Janson, 2004)). Suppose that {Xi}^^ satisfies (LD1), ai < 
Xi < hi for i G A and some real numbers cij and bi. Then, for t > 0, Denote S := YlieA-^i; 
then for t > 0, 

n.X > EX + t)< exp (- 2 -^£^_ - ) . (!,) 

The same estimate holds for F(X < EX — t) . 

Further Bernstein- type results are proven in (Janson, 2004), which take into account the 
variance of Xi, and thus give better bounds than (1.2) for sums of random variables with 
small variances, an example is the following: 

Theorem 1.2 (Theorem 2.3 of (Janson, 2004)). Suppose that {Xi} ie ^ satisfies (LD1), Xi — 
KXi < b for some b > 0, and all i G A. Then, for t > 0, Denote S := YlieA ^ ar (^i), then 
for t>0, 

f 8t 2 \ 

F(X > EX + t) < exp — — — — - . (1.2) 

1 " } ~ P V 25 X *(A)(S + bt/3)J 1 ; 

The main idea of the proofs in (Janson, 2004) is to separate ^2 ieA Xi into sums of inde- 
pendent random variables, and use the concentration properties of such sums. 

Our first result, Theorem 2.1, gives an upper tail bound for subadditive functions. The 
proof is based on the same idea as (Janson, 2004). An application is given: an estimate for 
the upper tail of the norm of random matrix with locally dependent entries. 

Example 1.1 made us look for other, stronger definitions of local dependence, that are suffi- 
cient for concentration for large class of functions ((GD) is based on (Chen, Goldstein and Shao, 
2011)). 

Definition 2 (GD). {Xi,i G ^4} satisfies graphical dependence if we can define a graph 
Q = (.4., £) such that for any pair of disjoint sets r^I^ in A such that there is no edge in 
E that has one endpoint in Y\ and another in T 2 , the sets of random variables X-p 1 and X^ 2 
are independent. In this case Q is called the dependency graph. We say {Xi,i G [n]} satisfies 
(GD, m) if Q has maximum degree at most m — 1. 

Definition 3 (HD). Let {Yi,i G [N]} be a set of independent random variables taking values 
in E = Ei x . . . x Ejv, and for each i G [n], let Si be subsets of [N], and Xi : Ys f — > Aj be 
random variables depending on Ys v For each j G [N], let Rj := {i G [N]s.t.j G Si} (i.e. Rj 
the set of Xi depending ofYj, and Si is the set ofYj that Xi depends on). 

We say that {X iy i G [n]} satisfies (HD, k, I) if \Si\ < k and \Rj\ < I for every i G [n],j G 
[N]. LetQ = ([n],8) be an undirected graph with an edge between i and j if Xi andXj depend 
on some common (i.e. if Si H Sj ®). If Q has maximum degree at most m — 1, then we 
say {Xi : i G [n]} satisfies (HD, m). 

A relation between (HD, k, I) and (HD, m) is given by the following lemma (the proof is 
given in Section 4): 
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Lemma 1.1. Suppose that {Xi : i G [n]} satisfies (HD,m) for some {Yi,i G [N]}, then we 
can define [Y-,i G [N']} such that {Aj : i G [n]} satisfies (HD,m,m) for {Y(,i G [N'}}. 

Example 1.2 (m-dependence) . A simple example to illustrate the difference between (HD,k,l) 
and (HD,m) is the following: let Y±, . . . , Y n be independent random variables, and 

X\ := fi(Yi, . . . , Y m ), X2 := fzO^, ■ ■ ■ , Y m+ i), . . . , X n := f n (Y n , Yi, . . . , Y m -i). 

Then one can easily prove (by breaking Y into groups of size m ) that X±, . . . , X n satisfy (HD, 
2, 2m -I) and (HD,2m - I). 

Example 1.3 (Triangles in Erdos-Renyi graph). Let G(n,p) be an Erdos-Renyi graph, with 
edges (^j)i<j<j< n X\j is the indicator function of the edge between i andj), and denote 
by (7 1 jjfc)i<j<j<fc<n the indicator functions of the triangles between vertices i,j,k. Then one 
can easily see that (Tijk)i<i<j<k< n satisfies (HD,n — 2,3) and (HD,3{n — 3)). 

It is an easy exercise to prove that 

(HD, m) (GD, m) (LD1, m). 
The reverse implications are false in general. 

(LD1, m) does not imply (GD, m), as we can see from Example 1.1. 

(GD, m) does not imply (HD, m), we can see this from the example in (Burton, Goulet and Meester, 
1993) where they construct a one - dependent sequence (future independent of past) which 
only satisfies (GD, m) (the existence of such a sequence was an open question for many 
years) . 

It remains an open question whether (GD, m) implies concentration inequalities for general 
functions. At the moment, we do not know of practical applications that satisfy (GD, m), 
but not (HD, m). 

1.2. S elf -bounding and a- self -bounding functions 

Self bounding functions were introduced in (Boucheron, Lugosi and Massart, 2000), and 
found many applications. In (Boucheron, Lugosi and Massart, 2009), the authors introduce 
(a, b) self-bounding and weakly (a, b) self-bounding functions. 

For independent random variables, (a, b) self-bounding functions are a large class of func- 
tions, that contain Hamming Lipschitz functions, configuration functions, suprema of positive 
valued empirical processes. They also imply Talagrand's convex distance inequality. 

In (Paulin, 2012a), we have defined a stronger condition, a-(a, b) self-bounding and weakly 
a- (a, b) self-bounding functions, and shown that such functions satisfy concentration inequal- 
ities under some dependence condition. As we are going to see, they also satisfy concentration 
inequalities under the (HD,/c,/) dependence condition. 

The following definitions of self-bounding functions are from (Boucheron, Lugosi and Massart, 
2009) (we made a slight generalization, they had Ai = . . . = A n . = X). 

For 1 < i < n, and x G A, let X-i = (xi, . . . ,Xi-i,Xi+i, . . . ,x n ), and let A_j := Ai x ... x 
Aj_i x A i+1 x...xA n . 

Definition 4. A function g : A — > R is called (a, b) -self-bounding for some a, b > if there 
are functions gi : A_j — > E such that for all i = 1, . . . ,n and all x G A, 
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1. < g(x) - gi{x_i) < I, and 

2 - Er=i(fi , ( x ) _ 9i{ x -i)) < a 9( x ) + b. 

A function g : A — > R zs called weakly (a, b) -self-bounding if there are functions g$ : A_,; — >■ R 
swc/i t/iat for all x G A, 

n 
i=l 

Remark 1.1. If g is (a, b) -self-bounding, then it is also (a,b)-weakly-self-bounding. If g is 
(a, b) -self-bounding for some g i; then it is also (a, b) -self-bounding for 

:= inf /(xi, . . . . . . (1.3) 

//g is weakly (a,b)-self-bounding, then in this paper we will also assume that gi(x_i) < g{x) 
for all x G A, and in this case, we can choose g\ as in (1.3). In the rest of this paper, we 
assume that gi is chosen as (1.3). 

For these functions, the following concentration inequalities hold ((Boucheron, Lugosi and Massart, 
2009) supposed that Ai = . . . = A n = X, but the same results trivially hold for this case): 

Theorem 1.3. ((Boucheron, Lugosi and Massart, 2009)) Let 

X := (X ly . . . , X n ) 

be a vector of independent random variables, taking values in A and let f : A — > R be a 
non-negative measurable function such that Z = f(X) has finite mean. For a, b > 0, define 
c — (3a — l)/6. Iff is (a, b) -self-bounding, then for all A > , 

logE[e^)]< (aEZ + ^ 2 
2(1 — C-f A) 



t 2 



For all t>0, 

F{Z > EZ + t} < exp . 
1 ~ V 2 ( aEZ + b + M) J 

If f is weakly (a, b) -self-bounding and for all i < n, all x G A, fi(x^') < f(x), then for all 
< A < 2/a, 

logE[e^)]< (aEZ+ ^ 2 
& L J - 2(1 -aA/2) 

and for all t > 0, 

/ t 2 

¥{Z > EZ + t} < exp 



2(aEZ + b + at/2 J ' 

If f is weakly (a, b) -self -bounding and f(x) — fi(x^) < 1 for each i < n and x G A, then for 
< t < EZ, 

F{Z < EZ — t}< exp 



t 2 



2{aEZ + b + c_t) / ' 
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We define a- self-bounding functions as in (Paulin, 2012a): 
Definition 5. Let Q — fii x . . . x Q n . Let a, b > 0. 

1. We say that / : Q — > M. is a-(a, b) self-bounding if there is a : A — > such that 

(a) f(x) - f(y) < Ei<„ai( x )lki Vi] for every x,y G Q. 

(b) Ojj(x) < 1 for every i < n,x G Q. 

( C ) T, t <n a i( X ) < a f( X ) +°- 

2. We say that f : Q — >• R is weakly a-(a, b) self-bounding if there is a : A — >• such 
that 

(a) f(x) - f(y) < Ei<„aiO)lki ^ Vi\ f° r ever V x,y eQ. 

(b) E i <n^) 2 <^f( x ) + b- 

Remark 1.2. It is easy to see that a-(a,b) self-bounding functions are also weakly a-(a,b) 
self-bounding. 

Remark 1.3. The following relations hold: 

(a, b) -self-bounding =>- weakly (a, b)-self-bounding 

t t 
a-(a,b)-self-bounding =>- weakly a-(a,b) -self-bounding 

The reverse implications are false in general. 
2. Results 

The following theorem bounds the moment generating function of subadditive functions of 
(LD1) variables. 

Theorem 2.1. Suppose that A := M. n (or M. n ), and f : A — > R is a subadditive function, i.e. 
for any x,y G A, f(x + y) < f(x) + f(y). 

Let A = [n], and suppose that {Xi} ie ^ satisfy (LD1). Then 

• If {Af\j is one of the smallest proper covers of A, having x(A) elements, then for 
6>0,' 

1 x(A) 

E ( e «/(*i>-.*0) < _i_ E (e ex{A)f ( x ^)) , (2.1) 

here Xj^. G A is defined by replacing all the components of X with zeros outside of Aj. 

• Suppose that f also satisfies f(cx) < cf(x) for every < c < 1, x G A. Let {(Aj,Wj)}j 
be an exact fractional cover with w j = X*(>A)- Then for every 9 > 0, 

E ( e »/(^.-.^)) < - J—^^E ( e «*W(*>0) . (2.2) 
X { ) ■ 
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Our next theorem is about (HD,/c,/) dependent random variables. 

Theorem 2.2. Let X = (Xi, . . . ,X n ) be a A valued random vector, satisfying (HD,k,l) for 
some Y = (Y l7 . . . , Y N ). 

• If f : A — >■ R is a-(a,b) -self-bounding for some a, b > 0, then there is a function g 
such that f(X) = g(Y) almost surely, and the function h defined as h(y) := g(y)/l is 
[ka, jfe) -self-bounding. 

• If f : A — > R is weakly a-(a, b) -self-bounding for some a,b > 0, then there is a function 
g such that f(X) = g(Y) almost surely, and the function h defined as h{y) := g{y)/l 
is weakly (/ca, yfe) -self -bounding, and satisfies \h(z) — h(z')\ < 1 for any z,z' differing 
only in one coordinate. 

Let A := Ai x . . . x A„, we say that a function / : A — > K is c- weighted Hamming Lipschitz 
for some c G M™ if for any x,y G A only differing in coordinate i, \ f{x) — f{y)\ < Q. 

Corollary 2.1. Suppose that X = {X i: i G [n]} satisfies (HD,l,k), X G A, then for any 
c-weighted Hamming Lipschitz f : A — > R, we have for every X > 0, 

E (exp (X[f(X) - Ef(X)])) < exp , (2.3) 



and thus for every t > 

-2t 2 



F(f(X) - Ef(X) > t),¥(f(X) - Ef(X) < -t) < exp ( >-f ^ ra 2 ) . (2.4) 



Corollary 2.2. Suppose that X = {X i: i G [n]} satisfies (HD,l,k), then a version of Tala- 
grand's convex distance inequality holds: 

E ( exp (is« 4KS) )) £ n^W' (2 ' 5) 

and as a consequence, 

P(X G S)F (X G Si) < exp (-t 2 /(10A:Z)) . (2.6) 



3. Applications 

Random matrix models with dependent entries have been considered by several authors, see 
for example (Anderson and Zeitouni, 2008). A model where the entries are (LD1) type, and 
satisfy some additional condition, appears in (Schenker and Schulz-Baldes, 2005), and then 
was further developed in (Hofmann-Credner and Stolz, 2008). These results show asymptotic 
convergence of the eigenvalue distribution to circular law or the singular value distribution 
to Marchenko-Pastur law. In this paper, we prove some non-asymptotic results. In our first 
example, we show concentration for the upper tail of the norm of a random matrix with 
(LD1) entries: 
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3.1. Norm of a random matrix with (LD1) entries 

Theorem 3.1. Let M := (Xjj)x<ij< n be a Hermitian matrix with entries bounded by K in 
absolute value, and the upper diagonal entries (^i,j)i<i<j< n satisfying (LD1) with neighbor- 
hoods A = {Aij}i<i<j< n . Then 

F(\\M\\ > 3C X *(A)KV^ + t) < exp (- 3^^ ) (3-1) 

here C is the universal constant in (Latala, 2005). 

If M := (Xjj)x<j< n j<iv is a complex valued matrix, with entries satisfying (LD1) with 
neighborhoods A = {<Ai,j}i<i<j< n , then 

P (||M|| > C X *(A)K (v^+ VN+ <fn~N) +t)< exp (3.2) 



3.2. Eigenvalues of a random matrix with (HD,k,l) entries 

The following theorem is a generalization of Theorem 1 of (Alon, Krivelevich and Vu, 2002) 
to this setting. 

Theorem 3.2. Let M be a real valued random symmetric matrix with entries bounded by 
1, and the upper diagonal entries satisfying (HD,k,l). Let \\(M) > ... > A n (M) be the 
eigenvalues of M in decreasing order. For every positive integer 1 < s < n, the probability 
that A S (M) deviates from its median by more than t is at most 4 e -* 2 /(80s 2 -^ 77^ 

same 

estimate holds for the probability that A.„_ s+1 deviates from its median by more than t. 

Remark 3.1. We leave it to the reader as an exercise to adapt the proof of (Meckes, 2004), 
Theorem 2 to this setting, and reduce the s 2 in the exponent to s. 

Remark 3.2. The correct range of concentration of the eigenvalues of a Wigner matrix M n 
is O I y l ° s ^ J in the bulk, and O (n" 1 / 6 ) on the edge, as it is shown in (Dallaporta, 2012). 



4. Proofs 



Proof of Lemma 1.1. Given Yi, . . . , Yjv and Si, . . . ,S n , let us define S[, . . . , S' n the following 
way: for evey 1 < i < N, i £ S'a if and only if 1 < j < n is the smallest index such that 
i £ Sj. Now let Y[ := Yg> , ■ ■ ■ , Y£ := Ys> n , then the reader can easily verify that X±, . . . , X n 
satisfies (HD,m,m) for {Y-, i £ [n]}. □ 

Proof of Theorem 2.1. Part 1 is implied by the subadditivity of /, and the convexity of the 
exponential function. For 9 > 0, 

e 6f(x) < e e(f(x Al )+...+f(x Ax(A) )) = ^E^^) 

< 1 e ex(A)f(x Ai ) 
- x(A) ^ 
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Taking expectations gives the result. Part 2 is similar: for 9 > 0, 

< e «x*(-A)E i5 ^/(^) ^ e fr*W(*,0 

Taking expectations gives the result. We have used the f(cx) < cf(x) condition (for < c < 
1, since < Wi < 1 for exact covers). □ 

Proof of Theorem 2.2. We are only going to prove the first part (concerning a-self-bounding 
functions), the second part is similar. 

The existence of g such that f(X) = g(Y) is trivial, since X is a function of Y. For each 
1 < i < N, y e S, let 

hi(y-i) := inf h(yi, y^, y[, y i+1 , ...,y n ). 

Vi 

Using the fact that f(X) is a-(a,b) self-bounding, we can write 

N . N 

Y %) - h i(y-i) ^ y Y Y a o( x iv)) 

i=l i=l jeRi 



1 - k h 

■jk^2a 3 (x(y)) < j(af(x(y)) + b)< kh(y) + jb, 



< 

and thus the result follows. □ 

Proof of Corollary 2.1. This follows by a 4 times worse constant from the fact that / is 
weakly a-(0, X^=i self-bounding. We can get this better constant by writing f(X) = 
g(Y), and directly applying Mcdiarmid's bounded differences inequality to the independent 
variables Y±, . . . , Yjy. □ 

Proof of Corollary 2.2. The proof is similar to the proof of Corollary 1 of (Boucheron, Lugosi and Massart, 
2009). By Lemma 3.2 of (Paulin, 2012b), we know that d 2 T {x,S) is weakly a-(4,0) self- 
bounding. By Theorem 2.2 we have a function g with g(Y) = d^(X, S), and h(y) = g(y)/l 
is (4fc, 0) self-bounding. 

Thus, by Theorem 1.3, we have that for < A < ^k, 

logE(e A (^ (x ' 5) ~ E( ^ (x ' 5)) )) = logE(e Ai(/l(y) " E(/l(y))) ) 

4kl\ 2 E{d 2 T {X, S)) 
~ 2(1 - 2kl\) ' 

thus 

AklX 2 

logE(e^M) < A E(4(X, S)) + _ E(<%(X, S)). (4.1) 

Again by Theorem 1.3, 

logP(X e S) = logP(4(X, S) - Ed 2 T (X, S) < -Ed 2 T {X, S)) 

= io gP(M y) - ea(v) < -Eft(y)) < -f§ = -Mj£*>. 
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By adding this to (4.1) for A = -^g, we get the result. □ 

Before proving Theorem 3.1, we introduce some results that we are going to use: 

Theorem 4.1 (Theorem 2 of (Latala, 2005)). For any finite matrix of independent mean 
zero r.v. 's we have 



E| | (X t ,) 1 1 < C (max EX 2 + max ^ EX§ + 

Proposition 4.1. Lei Ay fre a symmetric real valued matrix with entries bounded by K . Let 
Xi(Aij) be its largest eigenvalue. If we look at Ai as a function of only A\<i<j< n , then Ai is 
weakly (0, 16K 2 ) self-bounding. 

Proof. This is a reformulation of Example of page 42-43 of (Lugosi, 2005) (but they made 
a mistake by treating all the elements of the symmetric matrix as independent random 
variables, so the correct constant is 4 times worse). 
For some v € R n , 

Xi(A) = sup u l Au = v l Av = y VjVjAij, 

u6R n :||«||=l i - 

and for A' created by replacing Ay and Ajj with Ay, we have 

Ai(A) - Ai(A') < 2v i v j (A itj - A[j) < AK\ Vi \\ Vj \, 
so the result follows by 

E(^NN) 2 < iex 2 (EH 2 ) (EN 2 ) ^ 1QR2 - 



□ 



In the following proposition, we prove a similar result for largest singular value: 

Proposition 4.2. Let A := (Ay) be an n x N sized complex valued matrix with entries 
bounded by K in absolute value. Let S\(A) be its largest singular value (which is equal to its 
operator norm). Then S\ is weakly (0,4X 2 ) self-bounding. 

If A := {Aij) is a Hermitian n x n matrix, then s i; as the function of only A\< 
weakly (0, 16K 2 ) self-bounding. 

Proof. We can write, for some U, V complex valued unit vectors, that 



si(A) = sup Re[u*Av] — sup Re[u*VjAjj] 

u,i;GR rl :||u||,||ti||=l u,v:||u||,||u||=l ■■ 

Let's denote by A' the matrix that we get by changing A^ to A^-, then one can easily see 
that 

Sl (A) - Sl (A') < RepfVjAtj] - Rep^VjA^} < 2#|C/ i ||^|, 
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and thus the first statement of the proposition is implied by 

I 2 ) < 4K 2 . 



ij \l<i<n / \l<j<N 



ij \l<i<n / \l<j'_ 

The proof of the last statement is similar, and is left to the reader. □ 

Proof of Theorem 3.1. In the Hermitian case, let A = R n ( n-1 )/ 2 . Let / : A — > R be the norm 
of the Hermitian matrix with upper diagonal elements as argument. Let A = [n(n — l)/2], 
and let (Ai, Wi) be an exact cover for A with £\ Wi = x*(A). 

Then f(cx) = cf(x) for < c < 1, so by the second part of Theorem 2.1, 

w e wi) < _L-V™,e (V^IK, 

v > - X *(A) V 

Now for each j, is a symmetric matrix with independent entries, and thus, by the 
second statement of Proposition 4.2, we know that \\Ma- \ \ is weakly (0, 16K 2 ) self-bounding 
as a function of its upper diagonal entries. This means that by Theorem 1.3, we have for 
every A > 0, for every j, 

E(e X W M ^W) <e AE IKill .e 8K2x \ 



which, by setting A := x*(A)9, implies that 

E(e«) < e ^V(^_i^^ u , ie «x*(>iH|^||. (4 . 2 ) 

X*{A) . 

Now, Theorem 4.1, combined with the boundedness assumption, implies that 

E \\M Aj \ \ < 3C^, 

thus by Markov's inequality, we get 

P (||M|| > 3C X *(A)KV^ + t) < exp (8K 2 X *(A) 2 9 2 - 9t) , 

and taking 9 = 16x ^ A ^2 gives the bound. 

The proof of the rectangular case is similar. □ 

Proof of Theorem 3.2. This is just a simple adaptation of the argument of (Alon, Krivelevich and Vu, 
2002). The version of Talagrand's inequality for (HD,k,l) dependence, which follows from 
Corollary 2.2, is of the form 

Pr[A}Pr[B] < e ~ t2/{im \ 
and thus the constant the exponent becomes 80kl (instead of 32). □ 
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